Server-processor hybrid system for processing data

ABSTRACT

The present invention relates to a server-processor hybrid system that comprises (among other things) a set (one or more) of front-end servers (e.g., mainframes) and a set of back-end application optimized processors. Moreover, implementations of the invention provide a server and processor hybrid system and method for distributing and managing the execution of applications at a fine-grained level via an I/O-connected hybrid system. This method allows one system to be used to manage and control the system functions, and one or more other systems to co-processor.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is related in some aspects to commonly owned and co-pending patent application number (to be provided), entitled “PROCESSOR-SERVER HYBRID SYSTEM FOR PROCESSING DATA”, assigned attorney docket number END920070376US1, filed Nov. 15, 2007, the entire contents of which are herein incorporated by reference. This application is related in some aspects to commonly owned and co-pending patent application Ser. No. 11/877,926, entitled “HIGH BANDWIDTH IMAGE PROCESSING SYSTEM”, assigned attorney docket number END920070398US1, filed Oct. 24, 2007, the entire contents of which are herein incorporated by reference. This application is related in some aspects to commonly owned and co-pending patent application Ser. No. 11/767,728, entitled “HYBRID IMAGE PROCESSING SYSTEM”, assigned attorney docket number END920070110US2, filed Jun. 25, 2007, the entire contents of which are herein incorporated by reference. This application is also related in some aspects to commonly owned and co-pending patent application Ser. No. 11/738,723, entitled “HETEROGENEOUS IMAGE PROCESSING SYSTEM”, assigned attorney docket number END920070110US1, filed Apr. 23, 2007, the entire contents of which are herein incorporated by reference. This application is also related in some aspects to commonly owned and co-pending patent application Ser. No. 11/738,711, entitled “HETEROGENEOUS IMAGE PROCESSING SYSTEM”, assigned attorney docket number END920070111US1, filed Apr. 23, 2007, the entire contents of which are herein incorporated by reference.

FIELD OF THE INVENTION

The present invention generally relates to data processing. Specifically, the present invention relates to a server-processor hybrid system for more efficient data processing.

BACKGROUND OF THE INVENTION

Web 1.0 is historically referred to as the World Wide Web, which was originally about connecting computers and making technology more efficient for computers. Web 2.0/3.0 is considered to encompass the communities and social networks that build contextual relationships and facilitates and knowledge sharing and virtual web servicing. Traditional web service can be thought of as a very thin client. That is, a browser displays images relayed by a server, and every significant user action is communicated to the front-end server for processing. Web 2.0 is a social interaction that is consisted of the software layer on the client, so the user gets quick system response. The back-end storage and retrieval of data is conducted asynchronously in the background, so the user doesn't have to wait for the network. Web 3.0 is geared towards the 3 dimensional vision such as in virtual universes. This could open up new ways to connect and collaborate using 3D shared. Along these lines, web 3.0 describes the evolution of Web usage and interaction along several separate paths. These include transforming the Web into a database, a move towards making content accessible by multiple non-browser applications.

Unfortunately, the traditional server cannot efficiently handle the characteristics of Web 3.0. No existing approach addresses this issue. In view of the foregoing, there exists a need for an approach that solves this deficiency.

SUMMARY OF THE INVENTION

The present invention relates to a server-processor hybrid system that comprises (among other things) a set (one or more) of front-end servers (e.g., mainframes) and a set of back-end application optimized processors. Moreover, implementations of the invention provide a server and processor hybrid system and method for distributing and managing the execution of applications at a fine-grained level via an I/O-connected hybrid system. This method allows one system to be used to manage and control the system functions, and one or more other systems to serve as a co-processor or accelerator for server functions.

The present invention allows the front-end server management and control system components to be reused, and the applications such as virtual web or game processing components to be used as an accelerator or co-processor. The system components can be run using different operating systems. The front-end server(s) acts as a normal transaction based computing resource, but for which these transactions may be the trigger for large computationally intense graphics, rendering or other functions. The processor is placed at the back-end to handle such functions. In addition to traditional transaction processing, the front-end server(s) would also perform specific processor selection functions, and set-up, control and management functions of the cell co-processors.

A first aspect of the present invention provides a server-processor hybrid system for processing data, comprising: a set of front-end servers for receiving the data from an external source; a set of back-end application optimized processors for receiving the data from the set of front-end servers, for processing the data, and for returning processed data to the set of front-end servers; and an interface within at least one of the set of front-end servers having a set network interconnects, the interface connecting the set of front-end servers with the set of back-end application optimized processors.

A second aspect of the present invention provides a method for processing data, comprising: receiving the data from an external source on a front-end server; sending the data from the front-end server to a back-end application optimized processor via an interface having a set of network interconnects, the interface being embodied in server; processing the data on the back-end application optimized processor to yield processed data; and receiving the processed data from the back-end application optimized processor on the front-end server.

A third aspect of the present invention provides a method for deploying a server-processor hybrid system for processing data, comprising: providing a computer infrastructure being operable to: receive the data from an external source on a front-end server; send the data from the front-end server to a back-end application optimized via an interface having a network interconnects, the interface being embodied in the front-end server; process the data on the back-end application optimized processor to yield processed data; and receive the processed data from the back-end application optimized processor on the front-end server.

BRIEF DESCRIPTION OF THE DRAWINGS

These and other features of this invention will be more readily understood from the following detailed description of the various aspects of the invention taken in conjunction with the accompanying drawings in which:

FIG. 1 shows box diagram depicting the components of the server-processor hybrid system according to the present invention.

FIG. 2A shows a more detailed diagram of the system of FIG. 1 according to the present invention.

FIG. 2B shows a more specific diagram of the back-end application optimized processors(s) of the hybrid system according to the present invention.

FIG. 3 shows communication flow within the server-processor hybrid system according to the present invention.

FIGS. 4A-D shows a method flow diagram according to the present invention.

The drawings are not necessarily to scale. The drawings are merely schematic representations, not intended to portray specific parameters of the invention. The drawings are intended to depict only typical embodiments of the invention, and therefore should not be considered as limiting the scope of the invention. In the drawings, like numbering represents like elements.

DETAILED DESCRIPTION OF THE INVENTION

As indicated above, the present invention relates to a server-processor hybrid system that comprises (among other things) a set (one or more) of front-end servers (e.g., mainframes) and a set of back-end application optimized processors. Moreover, implementations of the invention provide a server and processor hybrid system and method for distributing and managing the execution of applications at a fine-grained level via an I/O-connected hybrid system. This method allows one system to be used to manage and control the system functions, and one or more other systems to serves as a co-processor or accelerator.

The present invention allows the front-end server management and control system components to be reused, and the applications such as virtual web or game processing components to be used as an accelerator or co-processor. The system components can be run using different operating systems. The front-end server(s) acts as a normal transaction based computing resource, but for which these transactions may be the trigger for large computationally intense graphics, rendering, numerically intensive computation or other functions. The processor is placed at the back-end to handle such functions. In addition to traditional transaction processing, the front-end server(s) would also perform specific processor selection functions, and set-up, control and management functions of the co-processors. It should be understood that as used herein, the term data can mean any type of data such as multi-modal” data—(video/data) streams, predefined block lengths, text, graphics, picture formats, xml, html etc. Having the server at the front-end of hybrid system provides (among other things) good at high transaction throughput, cell processor workload management, cell processor resource management.

Referring now to FIG. 1 a logical diagram according to the present invention is shown. In general, the present invention provides a server-processor hybrid system 11 that comprises a set (one or more) front-end servers 12 and a set of back-end application optimized processors (referred to herein as processors for brevity) 20. As shown, each server 12 typically includes infrastructure 14 (e.g., email, spam, firewall, security, etc.), a web content server 16, and portal/front-end 18 (e.g., an interface as will be further described below). Applications 19 and databases 18 are also hosted on these servers. Along these lines server(s) 12 are typically System z servers that are commercially available from IBM Corp. of Armonk, N.Y. (System z and related terms are trademarks of IBM Corp. in the United States and/or other countries). Each processor 20 typically includes one or more application function accelerators 22, and one or more database function accelerators 24. Along those lines, processor(s) 20 are typically cell blades that are commercially available from IBM Corp. (cell, cell blade and related terms are trademarks of IBM Corp in the United States and/or other countries). Moreover, processor(s) 20 are optimized based on the applications 19 that are running so that performance is optimal and efficient. As shown, server 12 receives data from an external source 10 via typically communication methods (e.g., LAN, WLAN, etc.). Such data is communicated to processor 20 for processing via an interface of server 12. Processed data can then be stored and/or returned to server 12 and onto external source 10. As depicted, server 12 represents the front-end of hybrid system 11 while processor 20 represents the back-end.

This system is further shown in FIGS. 2A-B. FIG. 2A shows external source(s) 10 communicating server(s) 12, which communicates with processor(s) 20 via interface 23. Typically, interface 23 is an input/output (I/O) cage embodied/contained within each server 12. Interface 23 also includes a set of network interconnects such as express peripheral component interconnects (PCIs) 25. Interface 23 may also include other components as indicated in the above-incorporated patent applications.

In any event, data will be received from external source(s) 10 on processor(s) 20 and communicated to server(s) via interface(s) 23. Once received, processor(s) 20 can will process the data, return the processed data to server(s) 12, which can return the same to external source(s) 10. Processor(s) 20 can also leverage staging storage and processed data storage devices to store the original data and/or the processed data. As shown in FIG. 2B, each processor 20 typically includes a power processing element (PPE) 30, an element interconnect bus (EIB) 32 coupled to the PPE, and a set (e.g., one or more) but typically a plurality of special purpose engines (SPEs) 34. The SPEs share the load for processing the data.

Referring briefly to FIG. 3, a more specific diagram showing the components' placement within hybrid system 11 is shown. As depicted, server(s) 12 receive/send data from external sources A and B, and route that data to processor(s) 20 for processing. After such processing, processed data is returned to server(s) 12, and to external sources A and B. Also present in hybrid system 11 is staging storage device 36 and processed data storage device 38. Staging storage device 36 can be used to store data prior to, during and/or after being processed, while processed data storage device can be used to store processed data.

Referring now to FIGS. 4A-D, a flow diagram of an illustrative process according to the present invention will be described. For brevity purposes (for the remainder of the Detailed Description of the Invention, server 12 of is referred to as “S”, while processor 20 is referred to as “C”. In step S1 external source (A) makes a connection request. In step S2, the connection request is passed on to C after validation by server S. In step S3, C accepts connection, S informs A of connection setup completeness. In step S4 stream P arrives from A at server S. S performs P′=T(P) where T is a transformation function on stream P. In step S5, S can save the data in storage and/or pass it to another device. In step S6, Output bytes are continuously passed onto C. In step S7, C performs P″=U(P′) where U is transformation function performed by C. In step S8, P″ is routed to back to S. S performs P³⁼V(P″) where V is a transformation function performed by server S in step S9. In step S10, P³ is routed continuously to B or A. Additionally, in step S10, A presents connection termination packet (E). In step S11, S receives E and in S12 S inspects E. In step S13, it is determined that E is a connection termination packet. In step S14, input sampling and computation stops. In step S15, S informs C of stream completion. In step S16, C stops computation. In step S17, C informs S of computation termination. In step S18, S informs B of connection termination. In step S19, S acknowledges A of computation completion.

Although not separately shown in a diagram the following is an example of another control flow made possible under the present invention. This control flow is useful in scenarios where requests are made directly by S to C without data being sourced from A or redirected to B.

-   1. S makes connection request -   2. Is the connection request valid? (performed by C) -   3. If yes, accepted by C -   4. Stream P arrives from S at Processor C (P can also just be     “block” input with a predefined length) -   5. C performs T(P) where T is a transformation function on stream P -   6. T(P) Output bytes are continuously passed back to S -   7. S encounters End-of-File or End of Stream -   8. S presents connection termination packet (E) -   9. C inspects E -   10. Is E a connection termination packet? -   11. If Yes, stop sampling inputs, stop computation on C -   12. C acknowledges S on computation termination

Under the present invention, both a push model and a pull model can be used. Control messages can be sent across a separate control path with data messages being sent over the regular data path. Here two separate connection IDs are needed. Control messages can also be sent along with data messages across the same path. In this case, only one connection ID is needed. Both Push and Pull models can be realized for separate or unified data path and control path. The push model is useful for short data where latency is a concern. Control messages usually have latency bounds for data transfer. This requires engagement of the data source computer processor until all the data is pushed out. The pull model is usually useful for bulk data where the destination computer can read data directly from the source's memory without involving the source's processor. Here the latency of communicating the location and size of the data from the source to the destination can easily be amortized over the whole data transfer. In a preferred embodiment of this invention, push and pull model can be invoked selectively depending on the length of data to be exchanged.

The following steps show how the push and pull models works:

Dynamic Model Selection

-   -   (1) S and C want to communicate. Sender (S or C) makes the         following decisions

-   Step 1—Is data of predefined length and less than Push Threshold     (PT)?

-   Step 2—If yes, then employ “push”

-   Step 3—if no, then data is of streaming nature without any known     size. S “shoulder taps” C without location address of data. Push     Threshold (PT) is a parameter that can be chosen for a given     application or data type (fixed length or stream) by the designer of     the system.

-   Push Model

S shoulder taps C with data block size (if known).

S looks up application communication rate requirements (R).

S looks up # of links in “link aggregation pool” (N).

S matches R and N by expanding or shrinking N [dynamic allocation].

S and C agree on number of links required for data transfer

S pushes data to C.

S can close connection in the following ways—when all data is sent (size known) & when job is complete.

S closes connection by shoulder tap to C.

-   Pull Model -   S shoulder taps C with data block size (if known). -   S looks up application communication rate requirements (R). -   S looks up # of links in “link aggregation pool” (N). -   S matches R and N by expanding or shrinking N [dynamic allocation]. -   S and C agree on number of links required for data transfer -   C pulls data from S memory. -   S can close connection in the following ways—when all data is sent     (size known) & when job is complete. -   S closes connection by shoulder tap to C

In FIG. 3, server 12 (“S”) and processor 20 (“C”) share access to staging storage device 36. If S needs to transfer dataset D to C then the following steps must happen—(i) S must read D and (ii) transfer D to C over link L. Instead, S can inform C of the name of the dataset and C can read this dataset directly from 36. This possible because S and C share staging device 36. The steps required for this are listed as follows (again, for brevity “S” and “C” are used to refer to server 12 and processor 20, respectively)

Step 1—S provides dataset name & location (dataset descriptor) along control path to C. This serves as “shoulder tap”. C receives this information by polling for data, “pushed” from S.

Step 2—C reads data from D using dataset descriptor.

Step 1—Push or pull implementation possible.

Step 2—Pull or push implementation possible.

Step 1 (push)—“Control Path”

-   -   S shoulder taps (writes to) C with dataset name & location (if         known).

Step 1 (pull)—“Control Path”

-   -   S shoulder taps C with data block size (if known).     -   C pulls data from S memory.

Step 2 (Pull form)—“Data path”

-   -   Staging storage device 36 stores table with dataset name and         dataset block locations.     -   C makes read request to staging storage device 36 with dataset         name D.     -   staging storage device 36 provides a list of blocks.     -   C reads blocks from staging storage device 36     -   C encounters end of dataset.     -   C closes connection.

Step 2 (push form)—“Data Path”

-   -   Staging storage device 36 stores table with dataset name and         dataset block locations.     -   C makes read request to staging storage device 36 with dataset         name D.     -   storage controller of staging storage device 36 pushes disk         blocks of D directly into memory of C.     -   D closes connection.

The foregoing description of various aspects of the invention has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed, and obviously, many modifications and variations are possible. Such modifications and variations that may be apparent to a person skilled in the art are intended to be included within the scope of the invention as defined by the accompanying claims. 

1. A server-processor hybrid system for processing data, comprising: a set of front-end servers for receiving the data from an external source; a set of back-end application optimized processors for receiving the data from the set of front-end servers, for processing the data, and for returning processed data to the set of front-end servers; and an interface within at least one of the set of front-end servers having a set of network interconnects, the interface connecting the set of front-end servers with the set of back-end application optimized processors.
 2. The front-end server-processor hybrid system of claim 1, the interface being an input/output (I/O) cage.
 3. The front-end server-processor hybrid system of claim 1, the interface being embodied each of the set of front-end servers.
 4. The front-end server-processor hybrid system of claim 1, each of the set of back-end application optimized processors comprising: a power processing element (PPE); an element interconnect bus (EIB) coupled to the PPE; and a set of special purpose engines (SPEs) coupled to the EIB.
 5. The front-end server-processor hybrid system of claim 4, the set of SPEs being configured to process the data.
 6. The front-end server-processor hybrid system of claim 1, further comprising a web content server, portal, an application, an application accelerator and a database accelerator.
 7. The front-end server-processor hybrid system of claim 1, further comprising: a staging storage device; and a processed data storage device.
 8. A method for processing data, comprising: receiving the data from an external source on a front-end server; sending the data from the front-end server to a back-end application optimized processor via an interface having a set of network interconnects, the interface being embodied in the front-end server; processing the data on the back-end application optimized processor to yield processed data; and receiving the processed data from the back-end application optimized processor on the front-end server.
 9. The method of claim 8, the interface being an input/output (I/O) cage.
 10. The method of claim 8, the back-end application optimized processor comprising: a power processing element (PPE); an element interconnect bus (EIB) coupled to the PPE; and a set of special purpose engines (SPEs) coupled to the EIB.
 11. The method of claim 8, the set of SPEs being configured to process the data.
 12. The method of claim 8, further comprising a web content server, portal, an application, an application accelerator and a database accelerator.
 13. A method for deploying a server-processor hybrid system for processing data, comprising: providing a computer infrastructure being operable to: receive the data from an external source on a front-end server; send the data from the front-end server to a back-end application optimized processor via an interface having a set of network interconnects, the interface being embodied in server; process the data on the back-end application optimized processor to yield processed data; and receive the processed data from the back-end application optimized processor on the front-end server.
 14. The method of claim 13, the interface being an input/output (I/O) cage.
 15. The method of claim 13, the interface being embodied in at least one of the set of front-end servers.
 16. The method of claim 13, the back-end application optimized processor comprising: a power processing element (PPE); an element interconnect bus (EIB) coupled to the PPE; and a set of special purpose engines (SPEs) coupled to the EIB.
 17. The method of claim 14, the set of SPEs being configured to process the data.
 18. The method of claim 13, further comprising: a staging storage device; and a processed data storage device.
 19. The method of claim 13, further comprising a web content server, portal, an application, an application accelerator and a database accelerator. 